A Centroid Based Text Categorization Method Using Mean Shift
نویسندگان
چکیده
منابع مشابه
Effect of term distributions on centroid-based text categorization
Most of traditional text categorization approaches utilize term frequency (tf) and inverse document frequency (idf) for representing importance of words and/or terms in classifying a text document. This paper describes an approach to apply term distributions, in addition to tf and idf, to improve performance of centroid-based text categorization. Three types of term distributions, called inter-...
متن کاملAn improved centroid classifier for text categorization
In the context of text categorization, Centroid Classifier has proved to be a simple and yet efficient method. However, it often suffers from the inductive bias or model misfit incurred by its assumption. In order to address this issue, we propose a novel batch-updated approach to enhance the performance of Centroid Classifier. The main idea behind this method is to take advantage of training e...
متن کاملGenetic Algorithm Based Text Categorization Using OLEX Method
The system describes new similarity-based genetic algorithm (GA) and thresholding Strategies (R&SCut variants). GA was designed to give appropriate weights to terms according to their semantic content and importance by using their co-occurrence information and the discriminating power values for similarity computation. After investigating the existing common thresholding strategies, design mult...
متن کاملMusic Genre Classification Using Text Categorization Method
Automatic music genre classification is one of the most challenging problems in music information retrieval and management of digital music database. In this paper, we propose a new method to classify music genres using text categorization methods. Differing from previous solutions which were mainly based on analysis on acoustic or symbolic audio signal, here we consider music as a text-like se...
متن کاملLarge margin DragPushing strategy for centroid text categorization
Among all conventional methods for text categorization, centroid classifier is a simple and efficient method. However it often suffers from inductive bias (or model misfit) incurred by its assumption. DragPushing is a very simple and yet efficient method to address this socalled inductive bias problem. However, DragPushing employs only one criterion, i.e., training-set error, as its objective f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Information and Computational Science
سال: 2013
ISSN: 1548-7741
DOI: 10.12733/jics20102921